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Abstract 

Background: Internalin A (In IA) facilitates the invasion of Listeria monocytogenes into a host cell. Some strains of 
Listeria monocytogenes express truncated forms of InIA, which reduces invasiveness. However, few virulence-related 
genes other than inIA have been analyzed in InlA-truncated strains. In the present study, we sequenced the draft 
genome of strain 36-25-1, an InlA-truncated strain, with pyrosequencing and compared 36 major virulence-related 
genes in this strain and a clinical wild-type strain. 

Results: Strain 36-25-1 possessed all of the virulence-related genes analyzed. Of the analyzed genes, only 4 genes 
(dltA, gtcA, iap, and inIA) differed when the nucleotide sequences of strain 36-25-1 and the clinical wild-type strain 
were compared. Analysis of the deduced amino acid sequences found no mutations that significantly influenced 
virulence in genes other than inIA. 

Conclusions: The virulence-associated genes in strain 36-25-1 differ little from those of the clinical wild-type strain, 
indicating that a slight mutation in the nucleotide sequence determines the virulence of the InlA-truncated strain. 
In addition, the results suggest that, aside from InlA-mediated cell invasiveness, there is almost no difference 
between the virulence of strain 36-25-1 and that of the clinical wild-type strain. 
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Background 

Listeria monocytogenes is a gram-positive, non-sporulating 
foodborne infectious pathogen. The pathogen can cause 
serious diseases, such as septicemia and meningitis, espe- 
cially in high-risk groups (pregnant women, neonates, and 
immunocompromised people) with a high mortality rate 
of 20-30% [1]. L. monocytogenes is ubiquitous in nature; it 
can survive under conditions of high salt and low pH. 
Because, it can grow even at low temperatures, the bacter- 
ium can be found in many kinds of foods during storage 
[2]. In particular, ready-to-eat (RTE) foods, which do not 
require heat cooking, are a main source of foodborne lis- 
teriosis cases [3-5]. 
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Internalin A (InIA) plays an important role in L. monocy- 
togenes invasion by attaching to host cells [6-9]. However, 
some L. monocytogenes strains express a truncated form of 
InIA, which lowers the invasion rate [10-14]. Truncated 
InIA, caused by a premature stop codon (PMSC) in inIA, 
often lacks the LPXTG motif that anchors InIA to the sur- 
face of the pathogen, leading to a decrease in invasiveness 
[12,13]. Some reports have shown that InlA-truncated 
strains account for 35-45% of L. monocytogenes in RTE 
foods [15,16]. However, aside from invasiveness, the viru- 
lence of these mutated strains has not been studied. 

Our previous study showed that an InlA-truncated 
strain had wild-type PrfA, which regulates the expres- 
sion of virulence related genes [11]. On the other hand, 
Temoin et al. (2008) reported that all of 5 InlA-truncated 
strains analyzed had the same amino acid sequence muta- 
tions in the migration factor plcA and the invasion factor 
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MB [17]. However, approximately 50 genes are related to 
the L. monocytogenes infection cycle [18], and most of 
them have not been investigated in strains with trun- 
cated InlA. A small number of studies have investigated 
virulence-related genes in InlA-truncated strains [11,17]; 
the studies do not completely explain the virulence of the 
strains. 

This study aimed to identify the major virulence- 
related gene sequences present in InlA-truncated strains. 
In recent years, the analysis of bacterial whole genomes 
has become faster and easier with the development of 
next-generation sequencing methods such as pyrosequenc- 
ing. In this study, we used pyrosequencing to construct a 
draft sequence of strain 36-25-1, and we compared 36 
main virulence-related genes in the InlA-truncated strain 
and a clinical wild-type strain. 

Results 

Presence of virulence-related genes 

After de novo assembly of the reads for strain 36-25-1, 
the total contig length was 2,957,538 bp with a peak 
depth of 11.0. The contigs aligned to 2,861,194 bp of the 
EGDe whole genome sequence, and showed 99.84% 
identity (Table 1). 

In strain 36-25-1, 36 open reading frames (ORFs) 
showed high similarity with the 36 EGDe virulence- 
related genes, indicating that strain 36-25-1 has all of 
these genes. 

Comparison of the nucleotide and amino acid sequences 
of the virulence-related genes 

Nucleotide mutations were found in 4 genes {dltA, gtcA, 
MA,and iap) of strain 36-25-1, when compared to EGDe 
(Table 2). Substitutions of 1 bp were found in dltA, gtcA, 
and MA. The mutation in iap was an insertion of 12 bp 
(Figure 1). 

In dltA, thymine was changed to cytosine at position 891 
of the ORF. This mutation is a silent mutation, which does 
not cause an amino acid sequence change (Figure 1A). On 
the other hand, the mutation in gtcA is a missense muta- 
tion, which affects the amino acid sequence; substitution of 
thymine with cytosine position 200 changed the phenyl- 
alanine in EGDe to serine in strain 36-25-1 (Figure IB). 
The inserted region in iap is a tandem repeat sequence. 
Whereas EGDe has 5 repeats of the ACAAAT motif, strain 
36-25-1 has 7 repeats, resulting in 2 additional threonine- 



Table 1 The draft sequences results 





36-25-1 




EGDe 


Total bases (bp) 


2,957,538 (11.0*) 




2,944,528 


Alignment length (bp) 


2,86 


1,194 




Similarity 


99.E 


S4 % 




The number in the parentheses 


shows the peak depth of de novo 





assembly results. 



asparagine (TN) repeats (Figure 1C). Among the genes 
analyzed, a nonsense mutation was found only in MA 
(Additional file 1). 

Mutation of virulence-related genes in other InlA- 
truncated strains 

The 4 genes, in which the nucleotide sequences differed 
between strain 36-25-1 and EGDe were also sequenced 
in other InlA-truncated strains (Lmal3, Lmal5, Lma20, 
and Lma28). The silent mutation in dltA was found in 
all of the InlA-truncated strains (Figure 1A). On the 
other hand, the missense mutation in gtcA found in 36- 
25-1 was not found in the other InlA-truncated strains 
(Figure IB). As for the tandem repeat of the ACAAAT 
motif in iap, strains Lmal3, Lmal5, and Lma20 had 3 
repeats; strain Lma28 had 2 repeats (Figure 1C). 

Discussion 

Virulence-related genes in strain 36-25-1 

The contig of the 36-25-1 strain constructed by de novo as- 
sembly showed similarity as high as 99.84% in the regions 
that aligned with the EGDe strain. In addition, this strain 
possessed all of the 36 virulence-associated genes analyzed. 
The genus Listeria is considered to have lost virulence- 
associated genes as it differentiated from ancestors that 
showed virulence [19]. Multiple virulence-associated genes 
are missing in strain 4a, a serotype of L. monocytogenes 
showing no virulence [20]. Because strain 36-25-1 pos- 
sesses all of the 36 genes investigated in the present study, 
we conclude that the InlA-truncated strain has not under- 
gone changes that have resulted in any major loss of re- 
gions present in the clinical wild-type strain. 

Virulence-related genes with mutations 

Among the 36 virulence-associated genes in strain 36- 
25-1, 32 genes possess a sequence identical to that of 
the corresponding gene in the EGDe strain. Therefore, 
we conclude that the virulence of these genes is the 
same in the 36-25-1 and EGDe strains. Nucleotide se- 
quence differences were found in only 4 genes (dltA, 
gtcA, iap, and MA). 

dltA is a part of the dlt operon, which is composed of 
4 genes that function in the addition of alanine to lipo- 
teichoic acid (LTA) [21]. Experiments using a strain in 
which dltA was artificially inactivated suggest that dltA 
influences the electric charge of the bacterial surface to 
increase adhesiveness to host cells [22]. The dltA mutation 
found in strain 36-25-1 is present in all other InlA- 
truncated strains examined in this study. Whether or not 
this mutation is characteristic of InlA-truncated strains re- 
quires investigation of other clinical wild-type strains. 
However, this mutation does not influence the phenotype 
of these strains as a silent mutation. 
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Table 2 The alignment results of 36-25-1 and EGDe 



Gene Mutation type Mutation loci EGDe allelic type* 36-25-1 allelic type* Function of mutation loci 
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"in the allelic type columns, the amino acid residues are described in the parenthesis. 


Similar to dltA, gtcA 


is involved in the addition of a 


The mutation in iap is in the tandem repeat region, in 


saccharide 


to LTA [21]. In the present study, the nucleo- 


which the number of repeats has been reported to vary 


tide sequence of gtcA 


in strain 36-25-1 differed from 


even among clinical wild-type strains [23-26] . p60, encoded 


that in the EGDe strain, and the encoded amino acid se- 


by iap, is involved in the movement of L. monocytogenes 


quence differed as well. However, this mutation is not 


inside a host cell and in cell-to-cell propagation [24]. p60 


common 


to InlA-truncated strains: the mutation was 


possesses multiple LysM motifs at its C-terminus, which 


not found 


in the other InlA-truncated strains examined. 


are used to bind to the cell wall of L. monocytogenes, and 
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EGDe dltA 
36-25-1 dltA 
Lmal3 dltA 
Lmal5 dltA 
Lma20 dltA 
Lma28 dltA 

B 

EGDe gtcA 
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Figure 1 The alignment of mutation loci in EGDe and InlA-truncated strains. Nucleotide sequences and amino acid sequences are shown 
for each strain. The numbers shown on the both sides mean the nucleotide sequence positions in the ORF of strain EGDe. The frames show 
identical sequences among the strains. (A) The alignment dltA. (B) The alignment gtcA. (C) The alignment iap. 



an SH3 structure at the N-terminus exposed outside the 
cell [27,28]. The tap mutation in strain 36-25-1 occurred 
between the LysM domain and the SH3 structure. There- 
fore, we conclude that this mutation has almost no influ- 
ence on the functions of these domains. 

Pathogenicity of InIA truncated strain 

The results of the present study indicate that, except for 
InlA-mediated cell invasiveness, virulence in the InlA- 



truncated 36-25-1 strain is almost equivalent to that in a 
clinical wild-type strain. However, these results differ 
from those of a study by Temoin et al. (2008), which dem- 
onstrated that 3 virulence genes, including inlA, were sim- 
ultaneously truncated [17]. In addition, Van Stelten et al. 
(2011), after orally administering an InlA-truncated strain 
to guinea pigs, reported that only the translocation rate to 
the spleen was lower in the truncated strain, when com- 
pared to a clinical wild-type strain [14]. Moreover, in the 
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study by Olier et al. (2005) using a strain that originally 
showed virulence but was genetically modified to express 
truncated InlA, the mortality of chicken embryos infected 
with the transformed strain was lower than that of chick- 
ens infected with a clinical wild-type strain [13]. These re- 
ports indicate that aspects of virulence other than cell 
invasiveness differ between InlA-truncated strains and 
clinical wild-type strains; this cannot be explained by the 
results of the present study. Therefore, we expect that mu- 
tations in the virulence-associated genes of InlA-truncated 
strains will exhibit heterogeneity. The results from InlA- 
truncated strains other than strain 36-25-1 in the present 
study support this hypothesis. In addition, many L. mono- 
cytogenes genes have functions that are not yet elucidated. 
Hence, we cannot exclude the possibility that genes not 
analyzed in the present study contribute to the differences 
between InlA-truncated strains and clinical wild-type 
strains. Analyzing other InlA-truncated strains and deter- 
mining the unknown functions of L. monocytogenes genes 
will resolve these questions. 

Conclusions 

In the present study, we analyzed the major virulence- 
associated genes in strain 36-25-1, an InlA-truncated strain. 
With the exception of inlA, the virulence-associated genes 
in the InlA-truncated strain are almost identical to 
those in a clinical wild-type strain. The results indicate 
that a slight mutation in the nucleotide sequence, such 
as a PMSC, determines the virulence of InlA-truncated 
strains. In addition, post-translational analysis of each 
gene indicated that, except for InlA-mediated cell invasive- 
ness, the virulence of the 36-25-1 strain is equivalent to 
that of clinical wild-type strains. However, this result does 
not completely explain the results of previous studies on 
InlA-truncated strains. To clarify the relationship between 
virulence and genes in InlA-truncated strains, analyses of 
more InlA-truncated strains, including pyrosequencing, 
and studies of gene expression are required to understand 
the virulence of InlA-truncated strains deeper. 

Methods 

Bacterial strains used in this study 

L. monocytogenes strain 36-25-1, with truncated InlA, was 
sequenced by whole genome shot gun sequencing to 
analyze virulence-related genes. The low invasiveness of 
the strain compared to that of the wild-type strain was 
shown in our previous study [11]. 



In addition, four InlA-truncated strains (Lmal3, Lmal5, 
Lma20, and Lma28) isolated from raw meat products were 
sequenced by Sanger sequencing for reference [29] . 

The whole genome sequence of EGDe, a clinical wild- 
type strain, was obtained from GenBank (GenBank ac- 
cession no. NC 003210). 

Genome extraction 

All L. monocytogenes strains were cultured overnight in 
brain heart infusion broth (Eiken Chemical, Tokyo, Japan) 
at 37°C. The bacterial DNA was extracted using the 
phenol-chloroform and ethanol precipitation method [30] . 
One milliliter of enriched culture was centrifuged at 
10,000 xg for 10 min, and bacterial cells were incubated 
in 567 uL of Tris-EDTA buffer containing lysozyme 
(2 mg/mL) for 1 h at 37°C. Cells were lysed by the addition 
of 30 uL of 10% (wt/vol) sodium dodecyl sulfate and 3 mL 
of 20 mg/mL proteinase K, with incubation for 1 h at 
37°C. Next, 100 uL of 5 M NaCl was added, and DNA was 
extracted with chloroform-isoamyl alcohol (24:1) followed 
by phenol-chloroform-isoamyl alcohol (25:24:1). DNA 
was then precipitated with isopropanol, washed with 70% 
ethanol, and dried. Purified DNA was dissolved in Tris- 
EDTA buffer and used as the DNA template for whole 
genome shot gun sequencing and Sanger sequencing. 

Whole genome shot gun sequencing and de novo 
assembly 

For whole genome shot gun sequencing, a Roche GS 
Junior platform (Roche, Basel, Schweiz) was employed 
using a GS Junior Rapid Library Preparation kit and GS 
Junior emPCR kit (Lib-L) according to the manufacture's 
protocol. The read sequences were used to construct a 
contig without a reference sequence by de novo assembly 
using the GS De Novo Assembler (Roche, Basel, Schweiz). 
In this assembly, the program parameters were set to: seed 
step, 12; seed length, 16; seed count, 1; minimum overlap, 
10; and minimum identity, 90. 

Extraction of virulence-related gene loci and comparison 
analysis 

The contigs of strain 36-25-1 and the EGDe whole gen- 
ome sequence were aligned using NUCmer, an applica- 
tion of MUMmer 3.0 (http://mummer.sourceforge.net/). 

The virulence-related gene loci of strain 36-25-1 were 
extracted from the contigs using GenomeTraveler (In 
Silico Biology, Kanagawa, Japan). Briefly, among the 



Table 3 The primers used in PCR amplification and sequencing for confirmation of the mutation 

Gene Forward Reverse Annealing temperature 

dItA AAGTAGTGCAGTTTAGGAGAGGA AGATTGTACCACCGGATGTC 58.0 

gtcA TTGAGCTCTTAGTAGAACCTGAC CTGGTTTCGCTATCTCATTAG 54.5 

tap CAAAATGCTACTACACACGCT GTCAAAGAATACTAAATCACCAGC 56.5 
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ORFs extracted from the contigs, those that showed 
high identity with EGDe virulence-related genes were 
selected for further analysis. 

The extracted gene sequences were aligned with the 
EGDe sequences by GENETYX verll.0.0 (Genetyx, Tokyo, 
Japan) to identify nucleotide mutations. When a genomic 
mutation was found, the corresponding amino acid se- 
quences were also compared. 

Sanger sequencing at the mutation loci 

When gene mutations were identified in strain 36-25-1, 
Sanger sequencing was conducted for confirmation. 
Other InlA-truncated strains (Lmal3, Lmal5, Lma20, 
and Lma28) were sequenced at the same loci. The 
primers for PCR amplification and Sanger sequencing 
were designed using ABI PRISM Primer Express v2.0.0 
(Life Technologies, Carlsbad, CA, USA) (Table 3). The 
PCR reaction mixture, prepared in 50 uL, contained 
10 mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 mM MgCl 2 , 
50 pmol of each primer, 2.5 mM of each dNTP, 25 ng 
template DNA, and 1.25 U Takara Taq DNA polymerase 
(Takara Bio, Shiga, Japan). The PCR program consisted 
of 94°C for 4 min; 30 cycles of 94°C for 30 sec, annealing 
temperature for 30 sec, and 72°C 2 min; and 72°C for 
4 min (Table 3). The PCR products were purified using 
Agencourt AMPure (Beckman Coulter, Brea, CA, USA). 
Sanger sequencing was conducted using an Applied 
Biosystems 3130 Genetic Analyzer (Life Technologies, 
Carlsbad, CA, USA) using a Big-Dye Terminator ver3.1 
Cycle Sequencing Kit (Life Technologies, Carlsbad, CA, 
USA) and Agencourt CleanSEQ (Beckman Coulter, Brea, 
CA, USA), according to each manufacturer's protocol. 

Availability of supporting data 

The draft genome sequences of L. monocytogenes strain 
36-25-1 are available in DDBJ/EMBL/GenBank under 
accession number BASN01000001-BASN01000122. The 
gene sequences of other strains Lmal3, Lmal5, Lma20 
and Lma28 are available under accession number 
AB845328-845343. 
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Additional file 1: The alignment of inIA in EGDe and InIA truncated 
strains. Nucleotide sequences and amino acid sequences are shown for 
each strain. The numbers shown on the both sides mean the nucleotide 
sequence positions in the ORF of strain EGDe. The frames show identical 
sequences among the strains. 
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